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Abstract 



Chandrasekaran, Chertkov, Gamarnik, Shah, and Shin recently proved 
that the average number of independent sets of random regular graphs of 
size n and degree 3 approaches w" for large n, where w « 1.54563, consis- 
tent with the Bethe approximation. They also made the surprising conjecture 
that the fluctuations of the logarithm of the number of independent sets were 
only 0(1) as n grew large, which would mean that the Bethe approximation 
is amazingly accurate for all 3-regular graphs. Here, I provide numerical 
evidence supporting this conjecture obtained from exact counts of indepen- 
dent sets using binary decision diagrams. I also provide numerical evidence 
that supports the novel conjectures that the number of kernels of 3-regular 
graphs of size n is given by y", where y 1.299, and that the fluctuations in 
the logarithm of the number of kernels is also only 0(1). 



1 Introduction 



In this paper I consider the problem of counting the number of independent sets 
and kernels of regular graphs. For the purposes of this paper, a graph will be 
defined as a collection of vertices and a collection of edges that connect pairs 
of vertices. Simple graphs are graphs with no more than one edge between any 
two vertices, and with no edges that connect a vertex to itself. The degree of a 
vertex is the number of edges connected to that vertex. A k-regular graph is a 
simple graph in which each vertex has degree k. An independent set is a set of 
vertices in a graph, no two of which are connected by an edge. A kernel, also 
called a "maximal independent set," is an independent set such that adding any 
other vertex to the set forces the set to contain a pair of vertices connected by an 
edge. 

Independent sets are closely related to "hard sphere" models that physicists 
use to model liquids and gases. In a hard sphere model, particles never overlap. 
Independent sets could therefore be seen as the legal positions of particles on 
a lattice, with no two particles being adjacent. In physics, the number of legal 
configurations of a hard sphere model is known as the "partition function," and 
the logarithm of that number is known as the entropy of the model. 

In a recent paper [1], Chandrasekaran et al. proved that the average number of 
independent sets for 3-regular graphs of size n will approach w n as n grows large, 
where w ~ 1.54563. This value was computed using the Bethe approximation 
from statistical physics 0. They also made a very surprising prediction about the 
fluctuations around this result. Ordinarily, the fluctuations between random sam- 
ples of similar systems will grow as y/n, where n is the size of the system. Instead, 
Chandrasekaran et al. conjectured that the standard deviation of the logarithm of 
the number of independent sets of random regular graphs will not be 0(y/n), as 
one might expect, but instead be 0(1) ! This implies that the Bethe approximation 
will always provide an amazingly accurate estimate for the entropy of independent 
sets for every randomly chosen 3-regular graph. 

Chandrasekaran et al. proved that their surprising conjecture is true if the 
Shortest Cycle Cover Conjecture (SCCC) of Alon and Tarsi [3] is true, but they 
offered no direct numerical evidence. This is most likely due to the difficulty of 
actually counting independent sets for large graphs. Counting independent sets, 
even for 3-regular graphs, is a #P-hard problem flU, meaning that the time it takes 
to count independent sets will grow exponentially as the graph sizes grow. Well- 
suited, however, to counting solutions to combinatorial problems is the binary 
decision diagram (BDD), first introduced by Bryant [5], and recently explicated 
by Knuth fl6]. In my paper, I use BDDs to gather numerical evidence that con- 
vincingly confirms the conjecture of [Q]|. 
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Figure 1 : A regular 6-graph. 



Chandrasekaran et al. did not make any predictions about kernels. We can still 
use the BDDs to gather evidence about kernels, however, and the evidence shows 
that kernels behave very similarly to independent sets. More precisely, I make the 
novel conjectures that the average number of kernels of 3-regular graphs grows as 
y", with y w 1.299, and that the fluctuations in the logarithm of that number are 
only 0(1) as n grows large. 

2 Independent Sets and Kernels 

In this section I will give a more detailed explanation of independent sets. Recall 
that an independent set is defined as a set of vertices in a graph, no two of which 
are connected by an edge. 

Figure Q] shows a 3-regular graph of six vertices (or "6-graph"). The indepen- 
dent sets of this graph would be {0, {1}, {2}, {3}, {4}, {5}, {6}, {1, 3}, {1, 
5}, {2, 4}, {2, 5}, {3, 6}, {4, 6}}, because those are the thirteen possible sets of 
vertices such that no two of them will be connected. 

Change the figure to another 6-graph of degree 3, and the independent sets 
change as well, as in figure [2l This graph's independent sets are {0, {1}, {2}, 
{3}, {4}, {5}, {6}, {1, 3}, {1, 5}, {2, 4}, {2, 6}, {3, 5}, {4, 6}, {1, 3, 5}, {2, 4, 
6}}. There are fifteen configurations here; this shows that there can be variance 
in the number of independent sets even in graphs of identical size and degree. 

Recall now that a kernel is a "maximal independent set" or an independent set 
to which one cannot add a vertex without also adding an edge. Although there 
were 13 independent sets for the graph in figure [H there are only 6 kernels: they 
are {{1, 3}, {1, 5}, {2, 4}, {2, 5}, {3, 6}, {4, 6}}. The graph in figure [2] contains 
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Figure 2: A different regular 6-graph. 



15 independent sets, but there are only two kernels: {{1, 3, 5}, {2, 4, 6}}. This 
shows that more independent sets does not necessarily translate into more kernels, 
and also shows that there is perhaps more variance in the number of kernels than in 
the number of independent sets. However, this paper will show that as the number 
of vertices grows, the number of kernels and independent sets will actually behave 
very similarly. 

Kernels and independent sets can also be thought of as binary functions of 
ones and zeroes. This is done by assigning each vertex of a graph a value of either 
or 1. When checking if a possible configuration of O's and l's is an independent 
set or kernel, one considers the vertices included in the set to have value 1, and 
those that are excluded to have value 0. The binary function for an independent 
set (or kernel) has a value 1 if the configuration corresponds to an independent set 
(or kernel), and otherwise. 

3 Binary Decision Diagrams 

Binary decision diagrams (BDDs) provide compact representations of binary func- 
tions [EJ [[6ll ; in our case the binary functions represent independent sets and ker- 
nels. Because BDDs are the source of all of the numerical evidence in this paper, 
it is essential that the paper contain an adequate explanation of them. 

Although graphs and BDDs look similar to each other, they serve quite differ- 
ent purposes. A BDD is composed of nodes and links between those nodes, only 
now the links "flow" in a particular direction and the relationship between the 
links and the nodes is more complicated than in a graph. Each node has a value, 
denoted V, a L0 branch, which "points" to another node, and a HI branch, which 
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Figure 3: A BDD for the majority function. 



also points to another node. Each node's (LO, HI) combination must be unique 
for it to be a true binary decision diagram, and at each node, LO ^ HI. At each 
node, V describes the variable on which the decision depends. For example, in a 
graph of size n as described above, it is often convenient to number the vertices of 
the graph 1 , 2, 3, ...,n. So if we were to use a BDD to describe the binary function 
corresponding to independent sets, a node with V = x would depend on the vertex 
numbered x in the graph. The LO and HI branches of this node would point to 
other nodes; the idea is that if the vertex numbered x had a value of 1, one should 
take the HI branch to the next node, and if it was equal to 0, one should take the LO 
branch. These nodes will eventually point to two "sinks," True and False. The 
sink one reaches by going down the tree will determine whether the path you have 
taken corresponds to the binary function having a value of 1 (True) or (False). 

This idea is best explained with an example. Let us suppose we have three 
binary variables, x\, xi, and X3, and let us suppose our binary function is the 
"majority function" which has value 1 if and only if two or more of the three 
variables have value 1. The BDD for this problem would look like figure [3] 

As we look at this BDD, we consider first the top node. In BDDs, a solid line 
denotes the HI path and a dotted line the LO path. Let us assume that x\ = 1 and 
therefore we take the HI path, to the leftmost 2 node. Here, we see that if we take 
the HI path again, we go directly to the True sink, without even considering 3. 
This is because once we know that both x\ and X2 equal 1, we already know that 
the majority function equals 1 — it doesn't matter what X3 is. In fact, it would be 
incorrect to add the redundant extra node: we stated earlier that no node in a BDD 
can have LO = HI, and both of the extra node's branches would point to True. 

Let us go back to the leftmost 2 node. If we choose the LO path, then that 
means x\ = 1 and X2 = 0. That means that for the majority function to equal 1, X3 
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must equal 1. That is why x^'s HI branch points to True and its LO branch points 
to False: this final variable decides the value of the majority function. 

Now we consider a more complex BDD. Figure|4]is the BDD for the indepen- 
dent sets of the first graph we looked at. Extra False sinks have been added for a 
clearer picture. 

This BDD is complicated, but one can still recognize some patterns. For exam- 
ple, one can only reach False sinks by taking a HI branch-this makes sense, since 
removing a vertex from an independent set always yields another independent set. 

The main use of BDDs such as this in this paper, however, is not to be read 
by humans, but to be read by computer. We exploit the fact that there exist algo- 
rithms to systematically construct the BDD for the independent sets and kernels 
of a graph Moreover, given a BDD, it is straightforward to exactly count the 
number of solutions of the binary function it represents, which in our case corre- 
sponds to the number of independent sets (or kernels). The counting algorithm H 
works as follows, where s is the total number of nodes in the BDD, counting the 
True and False nodes as one node each, and Vk, h, and % are V, LO, and HI for 
the &th node. 

• Step 1: [Loop over k]. Set cq <— 0, c\ <— 1, and do Step 2 for k = 2, 3, ...,s — 
1. Then return the answer 2 Vs ~ l c s -\. 

• Step 2: [Compute c k ]. Set l<-l h h<- h k , c k <- 2 v ' _v * _1 q + 2 v '> _v * _1 c/ ! . 

Using this algorithm, it is possible to quickly and efficiently count solutions to 
independent sets of reasonably small graphs. This is what I did for graphs of 
degree 3 and sizes 6 to 40 (even numbers only, because it is impossible to have a 
graph of odd degree and odd size). My data convincingly confirm the conjecture 
of Chandrasekaran et al. for independent sets, and lead to similar conjectures for 
kernels. 

4 Numerical evidence 

To create the data presented in this section, I generated 1000 random 3-regular 
simple graphs of each even size between 6 and 40. It is easy to do this using an 
algorithm that randomly adds edges between vertices that still have fewer than 
three edges, and that have not previously been connected. For each graph I then 
created a BDD by automatically generating, for that graph, appropriate input for 
D. E. Knuth's BDD creation program Q written in his "BDD language." The 
above counting algorithm was used to exactly count the number of solutions of 
each BDD. 
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Figure 4: A BDD representing the independent sets of a 3-regular 6-graph. 
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For both independent sets and kernels, the BDDs were created using a boolean 
function that was a large AND function of a collection of local functions. For the 
independent set case, each local function required that the variables correspond- 
ing to the two vertices on an edge were not both 1 . For the kernel case, the local 
functions required for each vertex variable that if it was was 1, all its neighbors' 
vertex variables were 0, and if it was 0, at least one of its neighbors' vertex vari- 
ables was 1 . This last condition corresponds to the requirement that one cannot 
add a vertex to a maximal independent set and have it remain an independent set. 

Knuth's program records the number of memory accesses it makes as it creates 
a BDD. Memory accesses in modern computers dominate the running time, so 
they serve as a good proxy for computational complexity. I found that the average 
number of memory accesses to create a BDD for the independent sets of a 3- 
regular graph of size n grew roughly as 400 x 1 .28". Because of the exponential 
growth in the complexity, BDD's, like any other algorithm for exact counting of 
independent sets and kernels, are limited to relatively small n. 

4.1 Independent sets 

Chandrasekaran et al. prove that at n grows large, the average number of inde- 
pendent sets of a 3-regular w-graph will approach w", where w = z _3//2 (2 — z)~ 1//2 
and z is a root of the equation z 3 + z — 1 = 0, giving w « 1.545634155. 

Numerically, we can estimate w for any n as w est = exp ( meaw ) , where mean 
is the numerically determined mean of the number of counts. Figure \5\ presents a 
table that shows that even using graphs of size n = 40 or less, we could numeri- 
cally estimate w accurately to three significant figures if we did not know its exact 
value. Note that the estimate of w seems to be approaching its ultimate exact value 
from below. 

Chandrasekaran et al.'s conjecture about fluctuations was precisely stated as 
follows in their Theorem 1 1 HI: 

"Let G be chosen uniformly at random among all 3-regular graphs with n 
vertices. Assuming SCCC is true, there exists a function f : (0, 1) — > IR + , so that 
| InZ - lnZ g | < /(e) with probability 1 - £, where \ lnZ g « In 1 .545." 

Here Z is the number of independent sets, Zb is the Bethe approximation to 
that number, and "SCCC" is the "Shortest Cycle Cover Conjecture," due to Alon 
and Tarsi (3l 0], which states "Given a bridgeless graph G with m edges, all of its 
edges can be covered by a collection of cycles with the sum of their lengths being 
at most 7m/ 5 = 1.4m." 

Chandrasekaran et al.'s Theorem 11 means that for any probability 1 — £, the 
fluctuations in the number of independent sets of regular 3-graphs will not be more 
than /(e) . Since /(e) does not depend on the size of the graph, that means that 
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n 


w n 


mean 


In mean 

r v est *" 


6 


13.635 


13.464 


1.5423952668 


8 


32.573 


31.815 


1.54109350802 


10 


77.815 


75.777 


1.54153624619 


12 


185.9005 


181.494 


1.54254741637 


14 


444.1134 


434.487 


1.54321669622 


16 


1060.980 


1041.904 


1.54388245415 


18 


2534.665 


2485.237 


1.54394400334 


20 


6055.279 


5930.353 


1.5440239311 


22 


14465.97 


14191.04 


1.54428663307 


24 


34558.98 


33960.44 


1.54450939167 


26 


82560.89 


81049.27 


1.54453602897 


28 


197236.7 


193795.5 


1.54466285137 


30 


471195.6 


462317.9 


1.54465451307 


32 


1125679 


1106305 


1.54479583718 


34 


2689230 


2639377 


1.54478373281 


36 


6424531 


6313624 


1.54488668558 


38 


15348108 


15109601 


1.54499725062 


40 


36666398 


36075768 


1.54500677979 



Figure 5: Numerical results for the number of independent sets in random regular 
graphs, compared with the Bethe approximation estimate of flU, which says that 
the mean should approach w n , with w ~ 1.54563, for large n. 
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0.00 

0.0 0.2 0.4 0.6 0.8 1.0 
Figure 6: Estimate of /(e) for independent sets of 3-regular 6-graphs. 

the fluctuations are 0(1). The data that follows in this section is meant to test 
this claim (which depends on the unproven SCCC), by numerically finding the 
function /(e). 

The approach I take is to plot a numerical estimate /(e), by computing the 
difference | InZ — lnZg| for each graph, and then finding the probability 1 — e that 
the difference has a particular value for each n. Consider for example the plot 
shown in figure [6l which is for the n = 6 case. This graph only contains two val- 
ues on the vertical axis: one at approximately 0.0954, the other at approximately 
0.0476. This is because, as was mentioned before, there can only be 13 or 15 in- 
dependent sets of a regular 6-graph of degree 3, and those are the differences one 
finds (in the logarithm of the number) with respect to the Bethe approximation of 
approximately 13.635. As the number of variables becomes larger, however, the 
number of "levels" in the graph will also increase. For example, figure [7] shows 
the estimate of /(e) for regular 8-graphs. 

The value of /(0), which is the largest difference found between the true log- 
arithm of the number of independent sets and the Bethe approximation, increased 
from approximately .0954 to approximately .2646. This trend, however, will not 
continue, substantiating the prediction of Chandrasekaran et al. The largest dif- 
ference /(0) actually drops between 8 and 10, and will stabilize as the sizes get 
larger. So will the estimate for /(e), for general e, which stops looking like a 
series of step function and start taking on a smoother shape. Figure [8] is actually 
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0.25 
0.20 
0.15 
0.10 

0.05 L — : — ^ 
0.00 

0.0 0.2 0.4 0.6 0.8 1.0 
Figure 7: Estimate of /(e) for independent sets of 3-regular 8-graphs. 
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Figure 8: Estimates of /(e) for independent sets of 3-regular ^-graphs, with n = 
10,12,14,16. 
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Figure 9: Estimates of /(e) for independent sets of 3-regular n-graphs, with n 
taking all even values between 18 and 40, inclusive. 

four plots, for 3-regular graphs of sizes 10, 12, 14, and 16, superposed. 

Figure |9] shows the data for the remaining 12 plots, all superposed. Their sizes 
are comprised of the even numbers between 18 and 40, inclusive. It is difficult to 
believe that the figure is twelve different sets of data. Not only do the fluctuations 
not grow beyond some upper limit (which is all that is necessary for the Chan- 
drasekaran et al.'s conjecture to be true); they hardly change at all! This means we 
can confidently estimate the expected size of the fluctuations in the entropy from 
our numerical data. 

To emphasize how unusual the behavior shown in figure [9] is, I will present a 
similar set of data for random graphs that are selected to have the same size and 
the same average degree of 3, but are not necessarily regular. An example of a 
non-regular random 6-graph with average degree 3 is shown in figure [T0l This 
graph's independent sets are {0, {1}, {2}, {3}, {4}, {5}, {6}, {1, 3}, {1, 4}, {2, 
3}, {2, 6}, {3, 4}, {4, 6}, {1, 3, 4}}, so it has 14 independent sets. This graph 
was not previously possible, of course, because vertices 3 and 4 only have two 
edges, and vertex 5 has five. 

For the class of rc-graphs with average degree 3, the average number of in- 
dependent sets is somewhat larger than it is for random 3-regular graphs. I find 
that the average number of independent sets grows as x n , where x ~ 1.594. For 
the fluctuations, one can measure a function /(e) defined by /(e) = | InZ — n lnx| . 
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Figure 10: A 6-graph with average degree 3, that is not 3-regular. 



1.00 
0.80 
0.60 
0.40 
0.20 
0.00 

0.0 0.2 0.4 0.6 0.8 1.0 

Figure 1 1 : Estimates of /(e) for independent sets of n-graphs with average degree 
3, with n taking all even values between 10 and 36, inclusive. The fluctuations 
consistently grow as n increases. 
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n 


mean 


In mean 

yest = e " 


8 


7.941 


1.29564015538 


10 


14.437 


1.30601358862 


12 


23.420 


1.30056553464 


14 


39.822 


1.30105155128 


16 


66.855 


1.30038175746 


18 


112.229 


1.29985445627 


20 


189.283 


1.29973729397 


22 


321.368 


1.30003386341 


24 


540.124 


1.29973224901 


26 


904.901 


1.29931791247 


28 


1516.237 


1.29896911345 


30 


2581.067 


1.29935147154 


32 


4333.530 


1.29912609539 


34 


7308.847 


1.29910009294 


36 


12285.019 


1.29895400448 


38 


20694.544 


1.29889831749 


40 


34996.192 


1.29897481351 



Figure 12: Table showing the mean number of kernels, averaged over 1000 3- 
regular n-graphs for each value of n, and the numerical estimate for y, where the 
average is given by y n . 

Figure[TT]shows the estimated /(£), obtained in the same way as the regular graph 
data, for the random graphs with average degree 3, and for n between 10 and 36. 
Here, the fluctuations are clearly increasing with n, as one would expect. 

4.2 Kernels 

Next we look at the data for kernels. First, I find that the average number of 
kernels for 3-regular n-graphs is approximately equal to y n , with y rs 1.299. This 
value of y can be read off from the table presented in figure [[21 Notice that y 
actually seems to reach its ultimate value more quickly than w did. 

For the plots of the fluctuations, one can estimate a function /(e) analogous to 
the function for the independent sets using /(e) = | InZ — nlny|, with y = 1.299. 
With that in mind, figure [13] shows the estimated function /(e) for kernels of 3- 
regular rc-graphs, with n ranging over even numbers from 6 to 16 inclusive, while 
figure [141 shows /(e) for n ranging from 18 to 40. 

The estimated function /(e) which measures fluctuations for kernels looks 
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1.00 



0.80 
0.60 
0.40 
0.20 
0.00 

0.0 0.2 0.4 0.6 0.8 1.0 

Figure 13: Estimates of /(e) for kernels of 3-regular n-graphs, with n = 
6,8,10,12,14,16. 
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0.60 
0.40 
0.20 
0.00 

0.0 0.2 0.4 0.6 0.8 1.0 

Figure 14: Estimates of /(e) for kernels of 3-regular n-graphs, with n taking all 
even values from 18 to 40, inclusive. 
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similar to that for independent sets, albeit with larger fluctuations. This is not so 
surprising, if one recalls that regular 6-graphs have either 2 or 6 kernels, while they 
have 13 or 15 independent sets. Comparing figure [141 for kernels with figure[9]for 
independent sets, we see that the fluctuations in the entropy are nearly four times 
as large for kernels as independent sets. It is also clear, however, that numerically, 
kernels and independent sets share the essential property that their fluctuations do 
not grow as the graph size increases. I thus conjecture that the fluctuations in the 
logarithm of the number of kernels in 3-regular n-graphs will only be 0(1) as n 
grows large. 

The strong similarity between the numerical results for kernels and indepen- 
dent sets suggests that a Bethe approximation [2J could give a highly accurate 
result for the number of kernels. However, performing such a calculation turns 
out to be considerably more intricate for the case of kernels than it was for in- 
dependent sets, because the binary function representing configurations that are 
kernels is the AND of local functions of vertex variables that involve a vertex and 
all its neighbors (e.g. four variables in the case of 3-regular graphs), while for 
independent sets the local functions only involve simple pairs of vertices. I hope 
to report on the results of such a calculation in the near future. 
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